Recent Progress in the CUHK Dysarthric Speech Recognition System

نویسندگان

چکیده

Despite the rapid progress of automatic speech recognition (ASR) technologies in past few decades, disordered remains a highly challenging task to date. Disordered presents wide spectrum challenges current data intensive deep neural networks (DNNs) based ASR that predominantly target normal speech. This paper recent research efforts at Chinese University Hong Kong (CUHK) improve performance systems on largest publicly available UASpeech dysarthric corpus. A set novel modelling techniques including architectural search, augmentation using spectra-temporal perturbation, model speaker adaptation and cross-domain generation visual features within an audio-visual (AVSR) system framework were employed address above challenges. The combination these produced lowest published word error rate (WER) 25.21% test 16 speakers, overall WER reduction 5.4% absolute (17.6% relative) over CUHK 2018 featuring 6-way DNN cross out-of-domain trained systems. Bayesian further allows individual speakers be performed as little 3.06 seconds efficacy demonstrated CUDYS Cantonese task.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recent Progress in the Sphinx Speech Recognition System

This paper describes recent improvements in the SPHINX Speech Recognition System. These enhancements include function-phrase modeling, between-word coarticulation modeling, and corrective training. On the DARPA resource management task, SPHINX attained a speaker-independent word accuracy of 96% with a grammar (perplexity 60), and 82% without grammar (perplexity 997).

متن کامل

Speech Recognition Technology for Dysarthric Speech

The initial results of investigations into the use of current commercial automatic speech recognition (ASR) software by people with speech disability (dysarthria) is presented, together with a brief summary of the history of the development of ASR and its applications for the disabled. Results confirm the viability of dysarthric use, identify areas of further investigation for improved recognit...

متن کامل

Sentence modality recognition in dysarthric speech

متن کامل

Recent Progress in Robust Vocabulary-Independent Speech Recognition

This paper reports recent efforts to improve the performance of CMU's robust vocabulary-independent (VI) speech recognition systems on the DARPA speaker-independent resource management task. The improvements are evaluated on 320 sentences that randomly selected from the DARPA June 88, February 89 and October 89 test sets. Our first improvement involves more detailed acoustic modeling. We incorp...

متن کامل

Recent Progress in Corpus-Based Spontaneous Speech Recognition

This paper overviews recent progress in the development of corpus-based spontaneous speech recognition technology. Although speech is in almost any situation spontaneous, recognition of spontaneous speech is an area which has only recently emerged in the field of automatic speech recognition. Broadening the application of speech recognition depends crucially on raising recognition performance f...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE/ACM transactions on audio, speech, and language processing

سال: 2022

ISSN: ['2329-9304', '2329-9290']

DOI: https://doi.org/10.1109/taslp.2021.3091805